Combining Synthetic and Observed Data to Enhance Machine Learning Model Performance for Streamflow Prediction
نویسندگان
چکیده
Machine learning (ML) models have been shown to be valuable tools employed for streamflow prediction, reporting considerable accuracy and demonstrating their potential part of early warning systems mitigate flood impacts. However, one the main drawbacks these is low precision high values extrapolation, which are precisely ones related floods. Moreover, great majority evaluated considering all data equally relevant, regardless imbalanced nature records, where proportion small but most important. Consequently, this study tackles issues by adding synthetic observed training set a regression-enhanced random forest model increase number introduce extrapolated cases. The generated with physically based Iber precipitations different return periods. To contrast results, compared only fed data. performance evaluation primarily focused on using scalar errors, graphically errors event, taking into account precision, over- underestimation, cost-sensitivity analysis. results show improvement in trained combination respect observed-data regarding values, root mean squared error percentage bias decrease 23.1% 38.7%, respectively, larger than three years period. utility increases 10.5%. suggest that addition precipitation events existing records might lead further improvements models.
منابع مشابه
Improving the Performance of Machine Learning Algorithms for Heart Disease Diagnosis by Optimizing Data and Features
Heart is one of the most important members of the body, and heart disease is the major cause of death in the world and Iran. This is why the early/on time diagnosis is one of the significant basics for preventing and reducing deaths of this disease. So far, many studies have been done on heart disease with the aim of prediction, diagnosis, and treatment. However, most of them have been mostly f...
متن کاملImpacts of combining meteorological and hydrometric data on the accuracy of streamflow modeling
متن کامل
Machine Learning Models for Housing Prices Forecasting using Registration Data
This article has been compiled to identify the best model of housing price forecasting using machine learning methods with maximum accuracy and minimum error. Five important machine learning algorithms are used to predict housing prices, including Nearest Neighbor Regression Algorithm (KNNR), Support Vector Regression Algorithm (SVR), Random Forest Regression Algorithm (RFR), Extreme Gradient B...
متن کاملCombining Data Mining and Machine Learning for Effective Fraud Detection
This paper describes the automatic design of methods for detecting fraudulent behavior. Much of the design is accomplished using a series of machine learning methods. In particular, we combine data mining and constructive induction with more standard machine learning techniques to design methods for detecting fraudulent usage of cellular telephones based on profiling customer behavior. Specific...
متن کاملCombining Data Mining and Machine Learning for Effective User Profiling
This paper describes the automatic design of methods for detecting fraudulent behavior. Much of the de&,, ic nrrnm,-,li~h~rl ,,&,a n .am.L~ nf mn.-h;na lm..~:~~ e-. .. ..--..*.*yYYA’“.. UY.“b Y UISLUY “I III-Yllr IxuIY11~ methods. In particular, we combine data mining and constructive induction with more standard machine learning techniques to design methods for detecting fraudulent usage of ce...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Water
سال: 2023
ISSN: ['2073-4441']
DOI: https://doi.org/10.3390/w15112020